Logarithmic Fluctuations for Internal DLA 

David Jerison* Lionel Levine* Scott Sheffield* 
July 6, 2011 



Abstract 

Let each of n particles starting at the origin in 1? perform simple random 
walk until reaching a site with no other particles. Lawler, Bramson, and Grif- 
feath proved that the resulting random set A(n) of n occupied sites is (with 
high probability) close to a disk B r of radius r = \pnpn. We show that the 
discrepancy between A(n) and the disk is at most logarithmic in the radius: 
i.e., there is an absolute constant C such that with probability 1, 

B r _ciogr C A{-Kr 2 ) C B r+ cio g r for all sufficiently large r. 
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1 Introduction 

In the process known as internal diffusion limited aggregation (IDLA) one constructs 
for each integer time n > an occupied set A(n) C Z 2 as follows: begin with 
A(l) = {0}, and for each n let A(n + 1) be A(n) plus the first point at which a 
simple random walk from the origin hits Z 2 \ A(n). 

IDLA was first proposed by Meakin and Deutch [MD86] as a model of industrial 
chemical processes such as electropolishing, corrosion and etching. After discussing 
these applications, they write "it is also of some fundamental significance to know 
just how smooth a surface formed by diffusion limited processes may be." Their 
numerical findings indicated that such surfaces are astonishingly smooth: in two 
and higher dimensions, the fluctuations appeared to be at most logarithmic in the 
number of particles. 

In the more than two decades since, this finding has resisted mathematical proof. 
Diaconis and Fulton [DF91] identified IDLA as a special case of a "smash sum" 
operation on subsets of Z 2 . Lawler, Bramson and Griffeath [LBG92] proved that 
the asymptotic shape of the domain A(n) is a disk, and Lawler [Law95] showed that 
the fluctuations from circularity are at most of order r 1 / 3 up to logarithmic factors, 
where r = n/rr is the radius of the disk of area n. Later work [( 30, LP10] related 
internal DLA with multiple sources to variational problems in PDE. However, the 
basic issue raised by Meakin and Deutch and again by Lawler ( "A more interesting 
question... is whether the errors are o(n a ) for some a < 1/3." [Law95]) stood 
until Asselah and Gaudilliere [AG 10a] (with an appendix by Blachere) improved 
the bound of r 1 / 3 in dimensions d > 3 to r l /^ d+1 \ 
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Our main result shows that in dimension 2, the fluctuations from circularity are 
at most order logr. In fact we show somewhat more, namely that the probability 
that the fluctuations at time n = irr 2 exceed a logr decays like r~ 7 (where 7 can 
taken to be arbitrarily large, but the constant a may depend on 7). Let 

B r = {z £ I? I z\ + z\ < r 2 } 

be the set of lattice points lying in the open disk of radius r centered at 0. For 
convenience, we define the IDLA cluster A(t) for all t G K by setting A(t) = A([t\), 
where [t\ is the greatest integer < t (and A(t) = if t < 1). 

Theorem 1. For each 7 there exists an a = 0(7) < 00 such that for all sufficiently 
large r, 

p({B r _ alogr C A(irr 2 ) C B r+ak)gr .} c ) < r~\ 

The proof will show that 0(7) can be taken to be an affine function of 7. Taking 
C = 0(7) for large enough 7, Theorem 1 in particular implies (by Borel-Cantelli) 
that the fluctuations from circularity are O(logr) almost surely: 

P {B r _(7i g r C A(-irr 2 ) C B r+ ci ogr . for all sufficiently large r} = 1. (1) 

This is the first part of a three part article. In the second part, entitled Internal 
DLA in higher dimensions [JLS10], we show that internal DLA in 7L d for d > 3 
has even smaller fluctuations than in dimension 2: the higher-dimensional analogue 
of (1) holds with logr replaced by ylogr. We believe that these orders (logr in 
dimension 2 and ylogr in dimensions d > 3) are the best possible. We discuss 
a possible approach to proving matching lower bounds, as well as several other 
open questions, in Section 3. Using methods rather different from ours, Asselah 
and Gaudilliere [AG 10b] have recently announced an independently obtained upper 
bound of order log 2 r in all dimensions d > 2. 

In the third part, entitled Internal DLA and the Gaussian free field [JLS11], we 
show that the fluctuations from circularity have a weak limit which is a variant of 
the Gaussian free field. While both this paper and the sequel address fluctuations 
of IDLA, the results are of a rather different nature: the present paper is concerned 
with fluctuations at the fine scale of individual lattice points. The sequel addresses 
average fluctuations, and will show that in a certain sense the fluctuation from 
circularity averaged over a constant fraction of the boundary (here inner and outer 
fluctuations cancel each other out) is only constant order. In fact, the fluctuations 
from circularity converge in law to a particular Gaussian random distribution on 
the circle, whose time evolution is related to the Gaussian free field. 

2 Logarithmic fluctuations 

We begin in Section 2.1 with a lemma that rules out "thin tentacles" in the IDLA 
cluster (Lemma 2). More precisely, it is unlikely that z G A(n) and less than a 
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Figure 1: Top: IDLA cluster A(n) for n = 4 x 10 5 . Early points (see Section 2.2) 
are colored red, and late points are colored blue. Bottom: closeup of a portion of 
the boundary of the cluster. 



constant fraction bm 2 of lattice sites in the disk B(z, m) belong to A(n). Section 2.2 
establishes a rather rough a priori bound on the probability of certain types of very 
large fluctuations. Section 2.3 gives some large deviation results for the exit time of 
Brownian motion from an interval. These three sections comprise the background 
results needed for our argument. 

In order to control more precisely the deviation from circularity near a point 
( £ Z 2 , we define in Section 2.4 a discrete harmonic function : Z 2 — > K which 
approximates the harmonic function 




(2) 
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where z and £ are viewed as complex variables. (Discrete harmonicity will fail for 
Hq only at a few points near (.) The function H^{z) — ^ approximates the discrete 
Poisson kernel for the disk Bi^i, that is, the probability that simple random walk 
started at z first exits B|^| at £. We may extend the function Hq linearly to the 
grid Q of horizontal and vertical line segments joining vertices of Z 2 . If we begin 
with a collection of particles on the grid G, and we allow some of them to move 
according to Brownian motions on Q (see Section 2.5), according to determined 
starting and stopping rules, then the sum of over these particle locations is a 
continuous martingale (provided that no particle continues to move after reaching 
a point where discrete harmonicity fails). 




Figure 2: Level sets of F^. Note that = on 9B|^| and F^(0) = j^r. 

By the martingale representation theorem (see [RY05, Theorem V.1.6]), we can 
represent any continuous martingale as a time change of a standard Brownian mo- 
tion. Using the representation theorem and large deviations for Brownian exit times, 
we can control the likelihood that this martingale is larger or smaller than its mean 
by estimating its quadratic variation. 

We will ultimately show that large deviations of A(n) from circularity are un- 
likely without large deviations of the martingale for some £. For example, the fact 
that thin tentacles are unlikely (Lemma 2) means that with high probability, a point 
near some £ cannot belong to A(n) unless many points near £ belong to A(n). Since 
H^(z) is large for z close to £, this can be used to show that (when £ is outside the 
typical range of A(n)) the martingale is likely to be large if a point near £ belongs 
to A(n). Similarly, we can show that if £ fails to be hit (long after we expect it 
to have been hit) then the martingale has to be small at that time. (For this step 
we actually use a modified version of the process in which particles are frozen when 
they first exit a set called Vt^ that approximates B|^|, see Figure 5.) 

The martingale is a Brownian motion parameterized by its quadratic variation 
time, but the amount of quadratic variation time elapsed while A(n) is constructed is 
random. Note that the conditional expected amount of time elapsed when depositing 
one particle (given the locations of all previous particles) is the conditional variance 
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of the value of at the position that the particle is deposited. This value is 
easy to approximate on the event that A(n) is approximately circular. If, as a 
heuristic, we assume that A{n) is approximately circular for all n, then a back of 
the envelope calculation shows that the total time lapse in constructing j4(7t|£| 2 ) 
should be of order log|£| in dimension two (and of constant order in all higher 
dimensions). On the event that the total time lapse is about right, the probability 
of having a martingale fluctuation of C^log \ (\ standard deviations should decay 

like e~^ log '^ = |C| _C,2//2 - If C is large enough then we should be able to sum over 
all £ and show that we are unlikely to see fluctuations of this magnitude anywhere. 

This heuristic actually suggests the basic strategy of our argument: we assume 
some amount of regularity for the growth of A(n) and use that assumption to show 
that the quadratic variation time lapse is unlikely to be large, hence large martingale 
fluctuations are unlikely, and hence the growth of A(n) is even more regular than 
we assumed. This strategy is pursued in Sections 2.6 and 2.7, which contain the 
heart of proof. Using the ideas above, we will show that the presence of an "early 
point" (a point z visited at time less than vr|z| 2 ) implies, with high probability, the 
presence of a comparably late point at a previous time (Lemma 12); similarly, the 
presence of a late point implies, with high probability, the presence of a significantly 
earlier point at a previous time (Lemma 13). By iterating these two lemmas, we 
will see that if there were a substantial probability to have even a slightly early or 
late point (off by more than Clogr), then there would be a substantial probability 
of having a much larger fluctuation, which would contradict our a priori estimate. 

In general, we use Co, C\, . . . to denote large constants and cq,ci,... to denote 
small constants. Unless explicitly specified otherwise, these are all positive absolute 
constants. 

2.1 No thin tentacles 

Let A(n) be the internal DLA cluster formed from n particles started at the origin 
in Z 2 . For zGZ 2 and a positive real number m, let 

B(z, m) = {y G 1? : \y — z\ < m} 

where \z\ = (zf + z^) 1 ^ 2 is the Euclidean norm. 

The following lemma is a variant of [LP10, Lemma 5.12]. The basic estimates 
involved are similar to those used by Lawler, Bramson and Griffeath in their proof 
of the outer bound; see in particular [LBG92, Lemma 5b]. 

Lemma 2. There are positive absolute constants b, Co, and c$ such that for all real 
numbers m > and all z £ Z 2 with \z\ > m, 

F{zeA(n), #(A(n)nB(z,m)) <bm 2 } <C e- Com . (3) 
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In other words, it is unlikely that z belongs to the cluster A(n) but less than a 
constant fraction of sites in the ball B(#, m) belong to the cluster. Figure 3 depicts 
this unlikely scenario. 




Figure 3: A thin tentacle. 



In fact, the exponent on the right side of (3) can be improved to order to 2 / log m 
(and to order m 2 in dimensions d > 3). In appendix A.l, we prove these stronger 
bounds in all dimensions. We anticipate using these bounds for d > 3 in our sequel 
paper Internal DLA in higher dimensions. For purposes of the present paper, we 
will use only the bound (3) in dimension d = 2. 



2.2 A priori bound on probability of large fluctuations 

We expect a point z 6 Z 2 to first join the IDLA cluster A(n) at time about n = 7r\z\ 2 . 
This motivates the following two definitions, which are illustrated in Figure 4. 

Definition. For a positive real number m, we say that z £ Z 2 is m-early if z € 

.A(7r(|z| — m) 2 ). For a time N, let 

£m[N]:= |J {z € A(tt(\z\ - m) 2 )} 

zeA(N) 

be the event that some point in A(N) is m-early. 

Definition. For a positive real number £, we say that z G Z 2 is £-late if z ^ 

A(7r(|2| +f) 2 ). For a time TV, let 

CtW\= U {^^(N+^) 2 )} 

be the event that some point in B n—r- . is £-late. 

The next lemma gives the a priori estimate on the probability of very late points 
that we will use to get the proof started in Sections 2.6 and 2.7. It follows easily 
from Lemma 6 of [LBG92]; for completeness, we include a proof in appendix A. 2. 
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Figure 4: Illustration of an IDLA cluster A(n) (union of the black squares), an 
m-early point and an Mate point. The early point belongs to A{n) despite lying 
outside the disk B r+m , where r = y/n/ir; the late point does not belong to A(n) 
despite lying inside the disk B r _^. 



Lemma 3. There are absolute constants Co, cq > such that for all real £ > 0, 

F{£ e [100ir£ 2 }) <Coe~ Co1 . (4) 

To avoid referring to too many unimportant constants, we take Co large enough 
and Co small enough so that the same Co and cq work in Lemmas 2 and 3. 

To prove Theorem 1 we will show that for each 7 there is a constant a = 0(7) 
such that if r is sufficiently large and m = i = alogr and n = Trr 2 , then 

P(5 m (n)U <^ 7 - (5) 

The next lemma is an easy observation which shows that (5) implies Theorem 1. 
Lemma 4. For all £, m, N > we have 

n<N 

n<N 



s 



Proof. Taking complements in the definition, we obtain 
£ m [Nf= p| {z^A(7r(\z\-m) 2 )} 

zeA(N) 

= n n {z A(n)} 

zeA(N) n<min(Af,7r(|z|-m) 2 ) 
n ^ N z:\z\>J^+m 



Likewise, 



n<N 



C e [N] c = fl {^A(7r(\z\+e) 2 )} 



/N/it-e 

= n n 

n ^ z:\z\<y/nfn-t 

= n {v7-* cA(n) }- n 

n<7V 



2.3 Brownian exit times 

For x £ M, write E, x for the expectation on a probability space supporting a one- 
dimensional standard Brownian motion {B(s)} s >o started at -B(O) = x. For a, 6 > 0, 
let 

r(-a,b) = inf{s > | B(s) £ [-a, 6]} 
be the first exit time of Brownian motion from the interval [—a, b]. 
Lemma 5. Let < a <b and A > 0. // y/\(a + b) < 3, then 

Eoe Ar(-a,6) < j + 10Aa6 

Proof. Let c = (a + fe)/2, and for x G [— c, c] let 

/(x) = E x e AT (" c > c ) 

By translating the interval [—a, b] to [— c, c], it suffices to show that if \f\c < 3/2, 
then 

f(x) < l + 10A(c + x)(c-x). (6) 
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The function e xt f(x) obeys the heat equation in the strip [0, oo) x [— c, c], hence 

e xt f"(x) = -\e xt f(x). 
Since / is even and /(c) = 1, we conclude that 

cos 



cos 



(VAc)' 



Since cosx is concave in the interval [0, |], we have 

cos(V\x) < cos(\/Ac) + \/A(c — x) sin(\/Ac) 

which gives 

f(x) < 1 + V\(c - x) tan(VAc). 
Since tan# < 109 for E [0, |], we conclude that for x G [0,c], 

f(x) < 1 + 10Ac(c - x) < 1 + 10A(c + x)(c - x) 

which establishes (6). □ 

The following well-known estimate appears as Proposition II. 1.8 in [RY05]. 

Lemma 6. Let {B(s)} s >o be a standard Brownian motion with B(0) = 0. Then for 
any k, s > 

P \ sup B(s') > ks \ < e- k2s / 2 . 
l/e[o,«] J 

2.4 The discrete harmonic function Hq(z) 

Fix a lattice point ( S Z 2 . In this section we define the discrete harmonic function 
resembling in (2), which we will use to detect early points near £. 

For z £ Z 2 , let P n (z) be the probability that an re-step simple random walk 
started at the origin in Z 2 ends at z. The recurrent potential kernel for simple 
random walk in Z 2 is defined by 

oo 

g(z)=J2(Pn(0)-P n (z)). (7) 

n=0 

This sum is finite for all z£Z 2 , and its discrete Laplacian 

Ag(z) = 9(z + l)+9(z-l) + g(z + i ) +g (z- i ) _ ^ (g) 

vanishes for z / and equals 1 if z = 0. Here we have identified Z 2 with Z + iZ C C. 
Fukai and Uchiyama [FU96] and Kozma and Schreiber [KS04] develop an asymptotic 
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expansion for g(z). We will use only the following estimate: there are absolute 
constants A and Co (rendered explicit in [KS04]; see also [LL10, §4.4]) such that 

< C \z\- 2 . (9) 

We will use this sharp asymptotic estimate of the potential kernel to show that 
the discrete harmonic function defined below is close to the continuum harmonic 
function (Lemma 7c), which in turn gives a discrete mean value property: the 
average value of on the discrete disk B r is very close to H^(0) (Lemma 8c). 

Evidently, g(0) = and g is symmetric under the dihedral symmetries generated 
by the reflections across the axes x + iy \— > ±x±iy, and the diagonal, x + iy i— > y + ix. 
Moreover, g(z) > for all Exact values of g near the origin can be computed 

using the McCrea- Whipple algorithm, described in [KS04]. In particular, 

5 (1) = 1, g(l + i) = -. 

TT 

(We will use these exact values for convenience in specifying constants. But what 
we really need is not the actual values, but the property that g(l + i) > g(l), which 
is not surprising since 1 + i is farther from the origin than 1.) 

We construct the function for £ = x + iy in the sector S = {( | < y < x]. 
(For £ ^ S, we choose a dihedral symmetry <j) such that £ S, and define H^(z) := 
H^^z).) Let 

P=ICl 

and write 

C/p = ai + a 2 (l + i) 

with «2 = yip and a\ = {x — y)/p. The property we will use about the coefficients 
aj for j = 1, 2 is that they are positive and bounded. Indeed < ay < 1. 
Define 

TT 

H^z) = - [aig(z - ( - 1) + a 2 g(z - ( - 1 - i) - (ai + a 2 )g{z - Q\. 
Note that (using the symmetries) 

H c (O = ^[a ig (l) + a 2 g(l + i))>0 

H ( (C + 1) = |[a 25 (l) - (aa + a 2 )g(l)] < (10) 

F C (C + 1 + *) = |[ai^(l) - (ai + 0-2)5(1 + 01 < °- 

The last inequality follows from the fact that g(l + i) > g(l)- 
Extend linearly along each edge of the grid 

Q = {x + iy I x £ Z or y £ Z}. 



g(z) log |^| - A 

TT 
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Figure 5: Illustration of the set Of for £ = 6 + 41 The function is discrete 
harmonic except at the three points Q, £ + 1, and £ + 1 + «. Its value is positive at 
£ and negative at £ + 1 and £ + 1 + i. For general z G Q, we have H^{z) = KH^(w) 
where w is the first of these three points hit by a Brownian motion in Q started at z. 
Large green dots on the edges of Q represent points z £ Q for which flf(z) = l/2p. 
Together with £, these points form the boundary of Of. Small black dots indicate 
points in Z 2 n Of. For z E Of, the value H^(z) — l/2p is proportional to the 
probability that a Brownian motion in Q started at z first exits Of at Q. 



(By an edge of Q we mean a line segment of length 1 whose endpoints lie in Z 2 .) 
Now define Of as the connected component of the origin in the set 

{z e G - {Q : H c (z) > l/2p}. 

Recall the discrete Laplacian A on Z 2 , defined in (8). We say that a function / 
defined on a subset A C Q is grid-harmonic if it is continuous and 

• For each edge e of Q, if / C e n A is an interval, then / is linear on /. 

• For each vertex of Z 2 in the interior of A, the sum of the slopes of the four 
line segments starting at z (directed away from z) is zero. 

For example, this holds if A = Q and / is discrete harmonic on vertices and linear 
on edges. 
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Lemma 7. 



(a) is grid-harmonic on Qr, and > l/2p on £1^. 

(b) C € <9f^ ; and for all z € <9r^\{C} we Ziaue H^(z) = l/2p. 



(c) There is an absolute constant C\ < oo smc/j £/iai 



-2 



In particular, 



H C (0) - l/p\ < dp- 



id) 1 < H C (Q < 2. 



Proof. 



(a) By (10) we have i? f (C + 1) < l/2p and H ( (( + l + < l/2p, so the points 
C + 1 and C + 1 + * are n °t in Furthermore by definition C ^ O^. Since 
i?£ is discrete harmonic at all other lattice points, its linear extension is grid- 
harmonic in £7^. Consequently, the inequality > l/2p follows from the 
maximum principle. 

(b) This is immediate from the definition of Cl^. 

(c) The approximation for \z — C\ < 10 is trivial (choosing C\ sufficiently large). 
For \z — C| > 10, the bound (9) implies 



il f (2)-ai(log |z-C-l|-log |z-C|)-a 2 (log \z-C~l-i\-log \z-(\) 



O 




By the fundamental theorem of calculus 



ai(log \z - C — 1 1 - log \z - C|) + O2(log \z - C - 1 - i\ - log \z - C|) 



Z" 1 1 I" 1 1 + i 

/ Re dt — ao Re ; 

Jo z-C-t 2 J z-(-t(l + i) 

Re — — — - «2Re 1 + \ + ( -. — —. ) 
ar-C z-C \\z-(\ 2 J 



dt 




(d) Using the exact values g(l) = 1 and g(l + i) = 4/-7T, we have 



#c(0 = ^(«i + -« 2 ) = ^ 

Z 7T Z 



ai + 2a 2 
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with the constraints a« > and 

(ai + a 2 ) 2 + a\ = 1. 

Hence «i + a 2 < 1 and 

(7r/2)ai + 2a 2 < 2(ai + a 2 ) < 2. 

Thus -ff^(C) < 2. For the lower bound, one checks that aj and hence (ir/2)ai + 
2a 2 is a concave function of a 2 . At the extreme points (ai,a 2 ) = (1,0) 
and («i,a 2 ) = (0, l/v2), the values of H^(C) are 7r /2 and \/2- Therefore, 
#c(C) > > 1. □ 



A corollary of Lemma 7 is the following. 
Lemma 8. There is an absolute constant C 2 < oo such that 



(a) B p _c 2 C J\ C B p+ c 2 , where B r is the disk of radius 



r in 



(b) l/2p < H c (z) < l/(p - r - C 2 ) /or < r < p - C 2 and z £ B r . 

(c) For all < r < p, 



zeB r 



+ 



£ (2T f (*) - 2T f (0)) 

^en c nz 2 



< C 2 log p. 



Proof. 

(a) Recall that Lemma 7 (c) says 

Itf^-i^l^d/lz-CI 2 . 

We will show that 

1 C 



2p |z-Cp 



6 55 



P+2C 



C 



e 



p-2C- 



(11) 



(12) 



C " 2p ' \z-C\ 2 ' 
Since the level curve {z : F^{z) = l/2p} is the circle dB p \{(}, this confirms 
part (a) with C 2 = 2C\. 

To prove (11) and (12), note first that 



\VF c (z) 
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Indeed, by the Cauchy-Riemann equations, this is the modulus of the (com- 
plex) derivative of — z). Any curve from dB p to dB p+ c has length at 
least C. In particular following the integral curves vector field VF^ we see 
that F^ changes by at least 

C 

max z \ z — C| 2 

for z along the curve. These integral curves are the circles through £ centered 
on the tangent line to dB p at f, and \z — (\ is nearly constant along the portion 
of the integral curve between dB p and nearby concentric circles. Therefore the 
change in Fq is bounded as in (11), and similarly for (12). 

(b) The lower bound 1/2/3 holds for all z £ O^. As we saw in (a), the level curves of 
i*£ differ from level sets of by at most a fixed distance 2C\ . The supremum 
of F c in B r is F c ((r/p)£) = l/(p- r). 

(c) First suppose that r < p — 2C±. By Lemma 7(c), 

" • " - E W*) - W)) < E i — ^ + 

^eB r 



zeB r 



Ci 



< 8irCi log p 



z-(\< 



Let 5° be the union of unit squares around lattice points of B r . Since | Vi^ | < 
l/\z — £| 2 , we have (writing dz for area measure) 

f F c (z)dz-J2W <4 / -^<8vrlogp. 



^eB r 



Next, 



Moreover, 



F c (z)dz 



F c (z)dz 



< 2 



2tt 



7'f? 



1 

W _ 7 



rd9 < 8vr log p. (13) 



/ F ( (0)dz- £ F C (0) 



|vrr 2 - #B r |/p < lOr/p < 10. 



Combining all these inequalities, we have shown that 



£ (2T C (*) - ff<(0)) 
zeB r 



is within C2 log p of 



(F f (z)-F f (0))<fc = 0. 



(14) 
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The last equation (14) holds because is harmonic in B r . 

We now turn to the cases of Q.( and B r with p — 2C\ < r < p. This follows 
from the case Bp_2Cn which we have already proved. In fact, these other sums 
differ only on lattice sites in p — 2C\ < \z\ < p + 2Ci, and the sum over these 
sites is majorized the same way as in (13), 

52 \H ( (z)\< 52 min (^^' 4 ) <32vrC 1 logp. □ 

2eB p+ 2C 1 \ B p-2C 1 2; SB p+ 2C 1 \ B p-2C 1 

Remark. One can replace with harmonic functions for which Lemma 8(a) and (b) 
hold with B r replaced by shapes other than disks. However, the approximate mean 
value property Lemma 8(c) relies on B r being a disk. The main ingredient is the 
continuum mean value property (14), which can be rephrased as saying that har- 
monic measure on dB r from the origin is the same as from a uniform point in B r . 
Accordingly, the place where the approximate mean value property Lemma 8(c) is 
used marks a crucial step in the proof: The bound for B r is used in the proof of 
Lemma 12, and the bound for in the proof of Lemma 13. 

2.5 The martingale M c (t) 

For each integer n, recall that the set A{n) consists of n lattice points with A(n) C 
A(n + 1) growing according to the IDLA scheme. For £ G Z 2 , we will define a 
modified cluster A^(n) in which particles are stopped on exiting Q^; moreover, we 
will define A^(t) for all t G M, not just for integer values. 

To define the modified cluster, recall first the notion of Brownian motion (3(s) 
on the grid 

Q = {x + iy G C | x € Z or y G Z}. 

To construct /3(s), take a standard Brownian motion B(s) starting at at time 
so = until the first time s = s± that B(s) = ±1. For each open interval of the 
set Si = {s : < s < si, B(s) ^ 0} choose a direction (up, down, left, right) with 
equal probability 1/4, independently. Define a corresponding motion of a particle 
j3(s) = B(s) in an interval in which right was chosen (and f3(s) = B{s)i in an interval 
in which up was chosen, and similarly for left and down). On each interval except 
the final one, the particle returns to the origin. On the final interval it reaches 
a lattice point adjacent to the origin for the first time. We continue the process 
(centered at this new lattice point) until it hits a point adjacent to that one, etc. 

Let f3o(s), /3i(s), . . . be independent Brownian motions on the grid G, and for 
each integer n > 0, let 

< = inf {s > | P n {s) G (Z 2 - A ( (n)) U (0 - ft c )} 
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be the first time that /3 n visits either a lattice point outside the cluster A^(n), or 
any point outside the set f^. Then for < s < 1, define 

A c (n + s) = A c (n)U{f3 n+1 (s)} 

where 

Pn(a) = Pn Uin s* n jj , 0<s<l 

(and by definition, (3 n (l) = /3 n (s*)). The choice of is not important here; for 
our purposes, any increasing function f(s) satisfying /(0) = and f(s) t oo as s f 1 
would suffice in its place. Effectively, the particle moves at an ever-increasing rate 
so that it a.s. visits the set (Z 2 — A^{n)) U {Q — Q,A in less than unit time. 

The points of <9f^ where (eventually) many particles accumulate include (, but 
are typically non-lattice points on the grid Q. We regard A^(t) as a multiset: it may 
have points of multiplicity greater than one on the boundary 80,^, or at interior 
vertices at times when the moving particle is on top of an occupied site. When 
summing a function over Af(t), we take these multiplicities into account; thus for a 
function / defined on Cl^UdCl^ and t € [n, n + 1), the notation YLzeAc(t) f( z ) means > 

n-l 

E /W:=EM'(l)) + /(/3n(i-n)). 
zeA c (t) j=o 

Recall the function defined in Section 2.3. Because is grid-harmonic in 
(that is, discrete harmonic at lattice points, and linear on edges) , the process 

M c (i):= £ (# f (z)-tf c (0)) 

zeA c (t) 

(summing with multiplicity) is a continuous-time martingale adapted to the filtra- 
tion 

T t = <j{A^s) | < s < t}. 

We will refer to as the martingale with pole at £. 

By the martingale representation theorem of Dambis and Dubbins-Schwarz (see, 
e.g. [RY05, Theorem V.1.6]), 

M c (i) = B c (S c (t)) 

where B^(s) is a standard Brownian motion and S^(t) = (M^, M^)t is the quadratic 
variation of Mr. 

Note that S^(n) is a stopping time with respect to the filtration {^i>(s)}s>o, 
where T^(s) = inf{i | S^(t) > s}; and B^(s) is adapted to this filtration. By the 
strong Markov property ([MP10, Theorem 2.16]), for any n, the process 

B2(u) := B c (S c (n)+u)-B c (S c (n)) 
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is a standard Brownian motion that is independent of T n . It follows that we can 
write 

B c (s) = B c (S c (n)) + B?(s - S ( (n)), s G [S c (n), S c (n + 1)]. 

where B®(-), Bj-(-), . . . are independent standard Brownian motions started at 0. 
For -a < < b, let 

Tn (-a, b) = inf{u > | B%(u) <£ [-a, b]} 

be the first exit time of B™ from the interval [—a, b]. 
Lemma 9. Fix ( G 1? . Let 

-a n = min (H c (z) - H c (0)) 

b n = max (H c (z) - H c (0)). 

Then 

S{(n + 1) - S{(n) < T n (-a n , b n ). 
Proof. By the maximum principle, since is harmonic on A^(n) C f^, we have 

-a n < H c (z) - H c {0) < b n 
for all z G A^(n). Hence for all t G [n,n + 1], we have 

-a n < M c (t) - M c (n) < 6 n . 
That is, for all s G [^(n), ^(n + 1)], we have 

-a n < B c (s) - B c (S c (n)) < b n . 
That is, for u G [0, S^(n + 1) — S^(n)], we have 

-a n < B r c l (u) < b n . 

Hence the first exit of B n from the interval [— a n , b n ] occurs after time S^(n+ 1) — 
S^(n), which gives S^(n + 1) - S^(n) < T n (-a n , b n ). □ 

Fix m > 1. To detect the presence of m-early points in the aggregate A(t), we 
will use the martingales for £ lying just outside the aggregate. The next lemma 
helps control the elapsed time S^(t) on the event that there are no (m + l)-early 
points. Define 

r (t) = v / t7^ + 4m + 2C 2 . 
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Lemma 10. For all t > and all ( G Z 2 such that |£| > ro(t), 



E 



e s c (t) 1 



£m+l[t] c 



<t 40 . 



(15) 



Proof. Fix t, and abbreviate ro = ro(t). On the event £ m+ i[t] c , we have for all n < t 



A(n) C B 



n/V+m+l ' 



Hence by Lemma 8(b), on the event £ m -|_i[i] c we have 
< H c (z) - H C (Q) < 



±- < H c {z) - £T C (0) < _^ — — - 

2r r - y/n/n - m - 2 - C 2 



for all positive integers n <t and all z 6 vl^(n). 
We obtain from Lemma 9, 



where 



(S c (n) - S c (n - l))l £m+l[t] c < r n (-a, b n ) . 

1 , 1 

a = — , o n = , . 

2r r - y/n/ir - m - 2 - C 2 



Summing over n = 1 , . . . , t yields 



e m +i[t]< 



< e 



s cW 1 £ m+ i[*] c < e EUi T «(- a . 6 «)_ 



The Brownian exit times r n {— a, b n ) for n = 1, . . . , t are independent. Using Lemma 5 
with A = 1, we obtain 

t t t 



n=l 



n=l 



n=l 



Setting n = — m — 3 — C 2 we have 



E 6 " 



< 



(in 



— 70 ri - 7n7^ 
^ 2vrx 



r i - a; 



2vr 



ri 



n -y 



dy 



< 2nri log Tl r-r- - 27rr ° lo S 7T- 

ri — \/t/TT l>2 

Hence X^n=i 10a6 n < 107r log(ro/C2). We conclude that 



E 



e S ^l £m+l[tY 



<(r /C 2 ) Wn <t m . □ 
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In order to study late points in the cluster A(t), we will need a variant of 
Lemma 10 which helps control the elapsed time S^(t) when |£| is slightly smaller 
than ro (t) . In this case the quadratic variation of is larger due to particles that 
accumulate on the boundary <9r^. 

Lemma 11. Fix m > 2C 2 + 2 and £ < m. Fix ( G I?, and let t = vr(| CI + ■ If 
|C| > £, then 

(16) 



E 



< t 40 e 700m 



Proof. Let p = and suppose first that p > 5m. Let to = [^{p — 4m — 2C2 — 1) 2 J 
Then ro(to) < P> so by Lemma 10, 



E 



e Sdto) ls m[ to] 



< E 



A(*o) 



l£ m+1 [to] c 



< 4° < t 40 . 



(Here we have used the trivial inclusion £ m [to] c C £ m+ i[to] c .) 

Furthermore, because < 2 on all of d£l^, we have by Lemma 9 for n 
t + l,...,t 

S c (n)-S c (n-l)<r n (-j- p ,2). 
Hence, using the inclusion £ m [i] c C £ m [to] c , 



e m [t]' 



SM)1 



n=to+l 



By Lemma 5, we have logEe 7 "^ 2 p' 2 ^ < — for to < n < t, hence 

* 10 
V logEe r " <—(t-t )< 400m. 
P 

t=t +l r 

In the last inequality we have used that t < ir(p + m) 2 and m > 2C2 + 2. 

Since the exit times T n for n = to + 1, . . . , t are independent of one another and 
of Ft , we obtain 



logE 



e 5 c(*)i 



< logE 



+ V] log Ee Tn 

n=to+l 



< 40 log t + 400m. 

It remains to consider the case £ < p < 5m. In this case we take to = 0, obtaining 



logEe 5 < w < —t 
P 



Wir(p + £)< 
P 



< 40vrp < 700m. □ 
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2.6 Early points imply late points 

Now we begin the main part of the proof of Theorem 1. Fix 7 > 1. With constants 
b, Co and Co given by Lemma 2, and C2 given by Lemma 7, we set 

C 3 = max {(7 + 5 + log C )/c , (1000/6)C 2 } . 

(In particular, if T > 3 and m > C 3 logT, then C e- C ° m < T~^ +5 \) 
Recall the events 

£ m [T] = I some z G A(T) is m-early | 

and 

Ce[T] = I some z £ is £-late| 

defined in Section 2.2. 

Lemma 12. (Early points imply late points) Fix 7 > 1, T > 20 and m > CalogT. 
Ifi < (VlOOO)m, then 

F(£ m [T]nC e [T] c )<T-^ +1 \ 

In other words, with high probability either there is no m-early point by time T, 
or there is a (6/1000)m-late point by time T. 

Proof. The proof consists of two main estimates. First, if z G Z 2 is the first m-early 
point and joins the cluster at time i, then for a suitably chosen point £ near z, the 
quadratic variation S^(t) is small: equation (18) below. Second, if in addition there 
are no ^-late points by time t, then the martingale M^(t) is large: equation (19) 
below. To finish the proof we use the large deviation bound for Brownian exit times 
to show that these two events (small S^(t) and large M^{t)) are unlikely both to 
occur. 

Without loss of generality we assume that t = (6/1000)m. Note that £ m [T] is 
empty if m > T 1 / 2 , so we may also assume m < T 1 / 2 . For each integer t = 1,2, ... , T 
and each lattice point z, let 

Qz,t = {ze A(t) - A(t - 1)} D{ze A(n(\z\ - m) 2 )} n £ m [t - If 

be the event that z first joins the cluster at time t and z is m-early, but no previous 
point is m-early. Since A(T) C By, 

|J U Qz,t=£m[T}. (17) 
t<T 2SB T 

As in Lemma 10, we write 

ro = r (t) = \ftpK + 4m + 2C 2 - 
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Fix a lattice point z £ B(, and consider the unit square with lattice point corners 
containing rQz/\z\. Define £ := (o( z j t) to be the corner of that square farthest from 
the origin. Then 

ro < |CoCM)l < r + 2. 
By Lemma 10, we have for £ = (q(z, t) 

Hence by Markov's inequality, letting s = (2j + 100) log T, we have 

nSm+i [t] c n {S c (t) > s}) < T-^ +10 °) E e s ^)l £m+i[t]c < T -(27+60)_ 

On the event Q z ,ti there are no m-early points before time t; moreover, z is at 
unit distance from some point of A(t — 1), so z is not (m + l)-early. Therefore, 

Qz,t C £ m +i[t] c . 

and hence 

P (Qz,t n {S C (i) > s}) < T -^ +m \ (18) 
Next, we show that for Q = Qo{z, t), 

P ({M f (i) < (6/10)m} n £,[t] c n Q g>t ) < T^ +5 \ (19) 

In other words, on the event E := ££[i] c n Q z .t it is likely that M^(t) > (b/10)m. 

On the event Q z ,t, no z' G -A(t) is (m + l)-early, so we have \z'\ < y/i/7r + m + l < 
?*0 — C2 for all z' G In other words, 

A(t) cB ro _ C2 . 

Hence Lemma 8(a) implies that -Af(i) does not meet <9f^. In other words, on the 
event Q z t we have 

Ac(t) = A(t). 

Partition ^(i) into three disjoint sets as follows. 

A 1 = A c (t) n B ^_ e ; A 2 = A ( (t) D B(z, m); A 3 = A c (t) \ {A 1 U A 2 ). 

On the event E, the ball of radius \ftpK — £ is filled by A^(t), so 

and since A^(i) has i points, A 2 U ^3 has at most 10\/^ points. In particular, 
{#A 3 )/r < Wy/^£ and 

£ (H ( (z') - H C (0)) > -(#A 3 )H C (0) > -201 = -(6/50)m, 
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since H is non-negative on Q,^ and, by Lemma 7(c), H^(0) is close to 1/rp. Further- 
more, Lemma 8(c) implies 

(Hc(z') - H c (0)) > -C 2 logr > -C 2 logT > -(6/1000)m, 

where in the last step we have used that C 2 < 6C3/IOOO and CslogT < m. On the 
other hand, since \z\ > m, it follows from Lemma 2 that 

P({#^2 < bm 2 } n E) < C e- com < T~^ +5 \ 

Next, recall that on the event Q z ,t, the point z is m-early but not (m + l)-early, 
so that \/t/ir + m < \z\ < \/t/7r + m + 1. Moreover, £ is at distance roughly 
3m from z along the ray to z. F^(z) ~ l/3m, and using Lemma 7(c) we find 
that H^(z') > l/4m + 0(l/m 2 ) for all z' G B(z,m). Furthermore, H^(0) at l/r 
and tq > \Jtjis > 10m, so H^(z') — H^(0) > l/8m. Therefore, on the event 
{#A 2 > bm 2 } n E, we have 

£ (2T f (a>) - H C (0)) > (#^)^ > 6m 2 J- = (6/8)m. 

Summing the contributions to M^(t) from A\, A 2 and A3, we find that on the event 
{#A 2 > bm 2 } HE we have M ? (t) > (§ - ± - j^)m > (6/10)m. Hence 

{M c (t) < (6/10)m} C {#A 2 < fem 2 } U £ c . 

Intersecting with E, we obtain 

P({M f (t) < (6/10)m} n £/) < P({#A 2 < 6m 2 } nB)< T~ (7+5) 

which proves (19). 

Recall that s = (2j + 100) log T. Since m > C 3 logT we have (6/10)m > s. 
Using the large deviation bound for the maximum of Brownian motion, Lemma 6, 
we obtain 

P( {5 f (t) < s} n {M c (t) > (6/10)m} ) < P( {£ c (i) < s} n {B f (S c (f)) > s} ) 

< P J sup B c (s') > s \ 

[ S 'e[0,s] J 

< e ' s/2 = T" (7+50) . (20) 
Finally, we assemble these ingredients to bound 

T 

t [T]n£,pr] c ) = £ Y, nQ z ,tnc e [T] c ). 

t=l z£B T 
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Split each term in the sum on the right into three pieces, with £ = (,o( z i 0> as 
indicated below and majorize using (18), (19), and (20), respectively. We obtain 
with s = (27 + 100) log T, 

f{Qz,t n C e [T] c ) < F(Q Ztt n {5 f (t) > a}) 

+ P(Q, )t n {M f (t) < (6/10)m} n £<[i] c ) 

+ P({5 f (t) < s} n {M f (t) > (6/10)m» < 3T~^ + ^ 

Summing over z G By and i = 1, . . . , T, since T > 20 we conclude that 

F(£ m [T] n Ci[T] c ) < (2nT 2 ) ■ T ■ 3T~^ +5 ) < T^ +1 \ 

This completes the proof of Lemma 12. □ 

2.7 Late points imply early points 

Fix 7 > 1, and let 

C 4 = (7 + 3)(1500 + -) + + 2C 2 + 2. 

co c 

where C2 is the constant appearing in Lemma 8, and Cq and cq appear in Lemma 3. 

Lemma 13. (Late points imply early points) If £ > C4logT and m < £ 2 /C^logT , 
then 

F(C e [T}n£ m [T} c )<T-^ +1 \ 

In other words, with high probability, either there is no £-l&te point by time T, 
or there is an £ 2 jC^ logT-early point by time T. 

Proof. Without loss of generality we take m = £ 2 /C A \ogT. Note that since £ > 
C4 logT this ensures m > £. Let ( G Z 2 be such that |£| < y/T/n — £, and set 

p=\(\, T 1 = tt(p + £) 2 . 

Note that T x < T. Denote by L[(] = {( A(Ti)} the event that C is £-l&te. Then 

C e [T] = |J L[C]. 

ICI<v^>- £ 

We will use the martingale Mf(i) with pole at £. We first show that on the 
event L[Q, 

M c (Ti) < (21) 

To prove this, note first that by the maximum principle, the smallest value of 
on is 1/2/3 attained on the boundary dO,^. This boundary value is achieved 
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everywhere on d£l^ except at the point £, at which has a much larger value. 
On the event L[(], no particle reaches £ by time T\, so the values of at all 
boundary points reached at time T\ are 1/2/3, which is smaller than the interior 
values. Therefore, Mt(T\) is maximized if all the sites of are occupied, i.e., 
C A^(Tx). By Lemma 8(a), we have C B p+ c 2 , so 

Tt - #fl f > vr(p + if - tt(p + C 2 + if. 

Since £ > C^logT, the right side is at least irpi. Hence on the event L[(], we have 

M c (Ti) < (np£) (± - tf f (0)} + ( H d z ) ~ ^C(O))- (22) 

By Lemma 7(c), we have ^ — H^(0) = — ^- + 0{\), which shows that the first term 
on the right side of (22) is less than —31/2. By Lemma 8(c), the second term is at 
most C^logp < C^logT < £/2. This concludes the proof of (21). 

Now suppose that \(\ > £. Recall that m > £ > C4 logT and C4 > 1, so e m > T. 
By Lemma 11, 

E e S Cm)l^ [Ti]c < T 40 e 70 0m < e 740m (33) 

Letting s = 750m, we have by Markov's inequality 

P({5 c (Tx) > s}n£ m [T 1 ] c ) < e- s Ee s ^h £m[Tl]c < e~ 10m . 
Now by (21), 

F(L[() n 8 m [T] c ) < F(£ m [T] c n {S C (T X ) > s}) + P({5 f (7\) < s} n L[fl) 
< e " 10m + P{ < S C (T 1 ) < s, M c (Ti) < -£}. 

We use Lemma 6 to bound the second term. Since M^{t) = B^(S^(t)), where is 
a standard Brownian motion with B^(0) = 0, we have 



>{s f (ro < Sj M f (ri) < <p{ inf s c ( s ') 

L» '6[0,s] 



< -I \ < e 



-t 2 /2s 



We conclude that for all (GZ 2 such that |C| < \/Tj^ ~ 4 

P(£ m [T] c n L[C]) < e" 10m + e -^ 2 /i500m_ 

Since m > £ > (7 + 3) logT and £ 2 /1500m > (7 + 3) logT by hypothesis, each term 
on the right side is at most T~^ +3 \ Hence for £ < |£| < y/T/ir — £, we have 

F (£ m [T] c n L[(\) < 2T-^ +3 \ (24) 
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To take care of the case |£| < £, we use Lemma 3, which gives 

\J L[(] j = F(Ci[4tt£ 2 }) < C e- co£ < T-^ +3 \ 
Summing (24) over ( G B^^_ f — yields 

F(£ m [Tr n £,[T]) < P I (J L[C] J + £ nSnPT n L[C]) 

< r -(7+3) + ^-(7+2)^ 

This completes the proof of Lemma 13. □ 

The proof of the main result from Lemmas 12 and 13 is a routine iteration. We 
use the lemmas in their contrapositive direction (no early point implies no late point 
and no late point implies no early point) for progressively smaller values of m and 
£, starting from (4). 

Fix 7 > 1 and T > 20. Let i = ^/T/IOOtt. Then (4) imples 



P(A [T]) <C e 



-co^o 



If T is sufficiently large, say T > To (7), then the right side is less than T 
Now define 

m = (1000/6)4. 
Lemma 12 implies that if tuq > C3 logT, then 

F(£ mo [T]nC £o [T] c )<T-^ +1 \ 

so that 

P(£ mo [T]) <2T-(t +1 ). 
For k > 1, define 4 and recursively by 

4 = \J (C 4 logT)m fe _i 
m fc = (1000/6)4- 

By Lemma 13, if l\ > C4logT, then 

p(£,JT]n£: mo [T] c ) <r-(^ +1 ) 

hence 

P(£ £l [T]) < 3T-^ +1 l 
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By induction on k, we find that 

P(£ 4 [T]) < (2A; + l)T-^ +1 ) 

and 

P(£ m jT]) < (2fc + 2)T-^ +1 ) 

provided rrik > C3 log T and 4 > C4 log T. 
The recursion gives 

4 = a^-Y^ 

and 

l-2" fe l/2 fe 

rut = a m Q . 

where a = ( 1000/6) C 4 log T. 

Let C 5 = (1000/6) max(C 3 ,C 4 ). Then 

4,™ fc <(C 5 logT)T 1 / 2fe . 

For k > (log log T)/ log 2, we have 

r l/2* < 3, 

Therefore, if we choose largest so that 4 > SCslogT, then 

k < (log log T)/ log 2. 

Thus, in our final step we may use 

I := max(4 + i, C 4 log T) < 3C 5 log T 
m := max(mfc+i, C4 log T) < 3C5 log T 

to obtain 

P(£ £ [T]) < (2k + 3)T~( 7+1 ) < ^T~ 7 

P(£ m [T]) < (2fc + 4)T-(t +1 ) < 

for all sufficiently large T. 

Let r = \/T/tt and a = lOCslogr. Then a > SCslogT > £,m. By Lemma 4, 
we conclude that 

P{B r _ a i ogr C A(Trr 2 ) C B r+alogr } c < P{B r _ £ £ A(T)} + P{A(T) £ B r+m } 

<P0Q[T]) + P(£ m [T]) 
< r-T < r~ 7 . 

This completes the proof of Theorem 1. 
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d = 1 


d = 2 


d> 3 


Order of h(x) at typical x S A 


sfn 


•/log n 


1 


Order of max xg A M 2 -) 


sfn 


log n 


\/logn 


Order of A _1 YlxtA ^>{x/n)h{x) 


\/n 


1 


n l-(i/2 



Table 1: Order of various kinds of fluctuations for the discrete Gaussian free field 
in Z d . In the bottom row, (p : [—1, l] d — > M. is a smooth test function. 

3 Concluding Remarks 

We mention here a few open questions and possible extensions of our results, along 
with a brief overview of our forthcoming sequels. 

Gaussian free field 

Let A = [—n,n] d C Z rf and let h be an instance of the discrete Gaussian free field 
on A with zero boundary conditions. That is 

1. write 

E(f)= E \f{x)-f(y)\\ 

x,ydA,\x— y\=l 

2. let H be the (2n — l) d -dimensional vector space of real-valued functions on A 
that vanish on the boundary <9A, 

3. and choose h from the measure e~ E ( h ^ 2 dh (normalized to be a probability 
measure), where dh is Lebesgue measure on H. 

Table 1 gives the orders of various observables associated to h. (See [BZ10] for 
recent results on the maximum of h in two dimensions, and [Sh07] for a general 
survey on the GFF.) Perhaps surprisingly, the orders in the last two rows do not 
change if we replace A with a fixed co-dimension one subset of A. This shows that 
weighted averages of h are remarkably well concentrated (and become more con- 
centrated in higher dimensions). Appropriately normalized, they scale to Gaussian 
random variables with an asymptotic variance that depends <f>. 

Next, if we let 

r . . /minjnlz G A(n)\ 

L & = V n N 

represent the "lateness" of the vertex z, then we expect L and its higher dimensional 
analogues to behave similarly to h in all dimensions. (This is trivial for dimension 
d = 1.) Our sequel paper, Internal DLA and the Gaussian free field [JLS11] ad- 
dresses the bottom row (for certain test functions 4>) and shows that L has a variant 
of the continuum Gaussian free field as a scaling limit. 
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Lower bounds on fluctuations 

The present paper established the upper bound of logn for the fluctuations of L 
corresponding to the middle square of Table 1. Simulations [MMOO, FL10] indicate 
that fluctuations of this order are really present, and proving this is a very natural 
open problem. One possible approach would be to use the methods of this paper 
to show that for small enough c, the expected number of aberrations of size c log n 
(or c-\/log n in dimension d > 3) in A(n) grows like a positive power of n — and 
then using a second moment estimate to show that infinitely many such aberrations 
occur almost surely. The main new ingredient required would be to control the 
correlations between fluctuations in two different directions. 

Higher dimensions 

The d > 3 sequel to this paper [JLS10] uses variants of the arguments presented 
here. The function is replaced by a discrete Green's function for a ball in Z rf 
(with zero boundary conditions). The obvious analogue of (constructed from 
differences of Z rf Green's functions) does not have spherical level surfaces. This 
causes some extra complication in the estimates involved in Lemmas 7 and 8. It 
becomes necessary to estimate the discrete Green's function on the ball directly, 
instead of building on known estimates for the Z d Green's function. The argument 
requires a few other modifications as well (for example, taking £ to be a point inside 
the ball, instead of on the boundary). 

Fluctuations in a fixed direction 

The bound of order logr in Theorem 1 measures the "fluctuations in the worst 
direction." Another interesting quantity is the fluctuation in a specific direction, 
say along positive x-axis, as measured by the differences 

max{i > | (x, 0) E A(-irr 2 )} — r 

and 

r - minjx > | (x, 0) £ A(nr 2 )}. 

Based on the analogy with the discrete Gaussian free field, we believe that these 
random variables have standard deviation of order ylogr in two dimensions, and 
that their higher dimensional analogues have O(l) standard deviation. These orders 
match those in the top line of Table 1. 

In numerical experiments on a related quantity, Meakin and Deutch [MD86] 
found order ylogr fluctuations in dimension 2 and O(l) fluctuations in dimension 3. 

We have not been able to adapt our bootstrapping argument to prove bounds of 
this type on fluctuations in a fixed direction. An m-early point where m = ydog r 
contributes only yTogr to the martingale M^, which is swamped by the logarithmic 
error term in the discrete mean value property, Lemma 8(c). 
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Derandomized aggregation 

James Propp around 2001 proposed a growth process called rotor-router aggrega- 
tion, in which particles started at the origin in Z 2 perform deterministic analogues of 
random walks until reaching an unoccupied site. Large-scale simulations show that 
the aggregates produced in this way are substantially smoother than IDLA, with 
even smaller fluctuations from circularity [FL10]. Indeed, part of Propp's motiva- 
tion was to study how much of the random fluctuation can be removed by passing to 
the deterministic analogue. The data indicate that the deviation from circularity in 
rotor-router aggregation may even be bounded independent of the size of the cluster; 
that is, B r _c C A(iTr 2 ) C B r+ c for an absolute constant C. Yet the current best 
upper bounds (order logr on the inside and r 1 / 2 logr on the outside, [LP09]) remain 
distant from this goal. In trying to adapt our methods to rotor-router aggregation, 
we can define a deterministic analogue of the martingale M^(t) by summing the 
function Hr over the rotor-router aggregate A(t). At this point an interesting ques- 
tion arises: what should take the place of the martingale representation theorem for 
this deterministic process? 

External DLA 

External DLA, defined by Witten and Sander [WS81], produces intricate branching 
structures instead of a smooth spherical surface. Typically it is defined with particles 
starting from infinity, but there is also a version in which particles start at the origin. 
In this version, the initial occupied set ^4(0) is the set of sites outside some extremely 
large origin-centered ball. At time n we start a random walk at the origin and stop 
when it first reaches a site z n adjacent to A(n). Then we set A(n + 1) = A(n)L){z n }. 

Both this paper and the sequel (about the Gaussian free field) use a similar 
technique: namely, they both use geometric arguments to control the quadratic 
variation of continuous martingales obtained by summing the values of discrete 
harmonic functions over sets of particle locations. One intriguing aspect of this work 
is that all of the martingales we utilize are still martingales for external DLA (grown 
from the origin). Indeed, Kesten's upper bound on the growth rate of external 
DLA [Kes90] uses a closely related martingale. 

Because the geometry of external DLA is much more intricate, however, it be- 
comes more difficult to control the quadratic variation of the martingales. In our 
argument, the interlinked inner and outer estimates — indeed, even the notions of 
early and late points — rely on the fact that the cluster fluctuates around a pre- 
dictable asymptotic shape (the disk). In external DLA, on the other hand, even the 
large-scale geometry is random. 
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A Appendix: Proof of Lemmas 2 and 3 



A.l Bounds on the probability of thin tentacles in all dimensions 

This appendix is devoted to proving the following lemma, which strengthens Lemma 2. 
Since we anticipate using this result to bound the fluctuations of internal DLA in 
higher dimensions as well, we treat all dimensions d > 2. 

Lemma A. In dimension d = 2, there are positive absolute constants b, Co, and c 
such that for all real numbers m > and all z£Z 2 with B(z, m), 

F{zeA(n), #(A(n)r\B(z,m)) <bm 2 } <C e- cm2/logm . (25) 

For each dimension d > 3, there are positive absolute constants b, Co, and c 
(depending only on d) such that for all real numbers m > and all z G Z 2 with 
B(z,m), 

P{z£i(n), #(A(n) nB(z,m)) < 6m d } < C e" cm2 . (26) 

Note that even the thinnest possible tentacle, a path of length m, can be formed 
using only m(m + l)/2 random walk steps: for j = 1, . . . ,m, the j-th particle to 
enter B(z,m) takes j steps along the path and occupies the next site on the path. 
This shows that the exponent m? on the right side of (26) is best possible. 

Instead of working with the ball B(z,m) it will be slightly more convenient to 
use the box B(z, m) := z + [—m, m] d C "L d . Since the volume of a ball is (up to a d 
dependent constant factor) the same as that of a box of the same radius, Lemma A 
holds as stated if and only if it holds with ~B(z, m) replaced by B(z, m). 

Fix z 6 7L d , and for < j < m let Sj be the set of points x = (x\, . . . , Xd) € Z d 
such that 

max \Xi — Zi\ = m — j. 
i=i 

The sets Sj partition B(z, m) into m + 1 concentric shells, indexed from the outside 
inward. Since 7?\Sj is disconnected, at least one site in shell Sj must join the 
cluster A(n) before any site in Sj-i can join. 

It is enough to show that (25) and (26) hold if we replace A{n) with the set A, 
where A = A(n z )DB(z, m), and n z is defined to be the smallest n for which z E A(n). 
Now write aj := #(A n Sj) for j £ {0,1, ... , m}. The cluster A(n z ) intersects each 
shell Sj, so a,,- > 1 for all j = 0, 1, . . . , m. 

Given m, the number of possibilities for the sequence aj is at most ((2m + l) d ) m , 
which is much smaller than the reciprocal of the right side of (25) and (26). It thus 
suffices enough to show that, for each fixed sequence a = (aj^jLo with aj > 1 and 
^Jlo a j — bm d , (25) and (26) hold when the left side is replaced by the probability 
of the event 

m 

T a :=f}{#(AnS J )=aj}. 

j=0 
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We emphasize that Co and c are still absolute constants; in particular, they do 
not depend on the sequence a or on m. We will henceforth assume that such a 
sequence a has been fixed, and bound the probability P(7I). 




Figure 6: On the left: a possible value for A (black, with z in red at 
the center). Here m = 12 and the sequence ao, a\, . . . , a\i (giving number of 
black squares in the square shell Sj, indexed from outside inward) is given by 
4, 1, 2, 2, 2, 1, 2, 1, 1, 1, 3, 3, 1. Taking d = 1/2, the corresponding sequence bo, . . . , bu 
is given by 5, 5, 5, 5, 5, 3, 3, 3, 2, 2, 3, 3, 3, and a corresponding tower of squares is il- 
lustrated on the right. The sizes of the squares are fa = 5, fa = 3, fa = 2, fa = 3. 
The grey shells indicate the S ai , where ao = 0, a\ = 5, 02 = 8, and 03 = 10. Since 
d = 1/2, the number of black squares in the "annulus" formed by the union of the 
Sj crossing a given square on the right (other than the innermost square) is always 
at least twice the number black of squares in the corresponding annulus on the left. 

Now we will construct a related sequence bj indexed by [0, m] with two nice 
properties: 

1. bj is roughly the kind of sequence a,j that one would see if A were a tower 
of cubes. More precisely, let fa,..., fa be positive integers summing to m, 
representing the side lengths of the cubes. For i E {0, . . . , k}, let 014 = fa+- ■ • + 
fa. For j G {0, . . . , m}, let 7j be the unique integer i such that aj_i < j < on. 
Then set 

6i = (^)*" 1 . 
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Note tiiat££o&i = £?=!#• 

2. There is a small constant c' such that for each i = 1, . . . , k — 1 (but possibly 
not for i = k) 

c'(A/2) d < aj <</(&)*. 

Once we have fixed c', we can construct the sequence 6j from Oj in a deterministic 
way by induction: Given the cube side lengths (3\, . . . , /3j-i, take (3i to be the smallest 
possible such that the second condition holds. 

Next we make explicit our choice of d . For x£Z d , write F x for the probability 
measure associated to simple random walk (X(t))%2. in Z rf with X(0) = x. By the 
local central limit theorem [LL10, §2], if t is even, then for each site y of even parity 
m the ball B(x,Vi), we have 

F x (X(t) =y)> c t- d / 2 

for a constant Co > depending only on d. Let c\ = 0^2 ~ 5 eo, where uid is the 
volume of the unit ball in 

By [LL10, Prop. 2.1.2], there is a constant C2 > depending only on d, so that 

P \ max \X(s)\ > Vt/c 2 \ < c x . (27) 

|^0<s<t J 

Now take d = Wrf2 _d_4 C2 in the above construction of the sequence bj from dj. This 
defines a sequence of shells S ai as in Figure 6. According to the next lemma, if A' is 
any subset of Z d satisfying A' n Sj < cij for j = 0, . . . , m, then simple random walk 
started from any point in the i-th shell S ai has at least a constant c\ > probability 
of exiting A' before reaching S' ai _ 1 U S a . +1 . 

Lemma 14. Let A' be any set for which #(A' n Sj) < aj for each j = 0, . . . , m. 
Fix 1 < i < k and x S S ai . For simple random walk in 7L d started at x, let T be the 
first hitting time S ai _ 1 U S ai+1 , and let T' be the first hitting time of (A') c . Then 

F X {T>T'} >ci. 

Proof. Let h = min(/3j, and let t be the greatest even integer < (c2h) 2 . Let 

r = C2/1. By the local central limit theorem, for each site y of even parity in the ball 
B(x, r), we have 

F x (X(t) = y) > c r- d . 

Suppose first that < /3j. Let B be the set of points of even parity in the 
intersection 

OLi+l-l 

B(x,r)n |J Sj. 

0=oti 
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Note that #B > #B(x, r)/2 d+2 > (uj d 2- d - 3 )r d . Hence by our choice of d , 
#(A'nB)< aj<<fP&i = tfh d <#B/2. 

j=ati 

Hence 

F x {X(t)£A'}> V x (X(t) = y)>(#B/2)^>2 Cl . 

yeBn(A') c 

By (27), we have 

f x (t < t) < c x . 

We conclude that 

F{T > T'} > F X {T > t, X(t) $ A'} 

> 1 - F X {T <t}- F{X(t) £ A'} 

> 1 - ci - (1 - 2ci). 

The proof in the case /3j+i > is identical, using the shells \J"= a . 1+ i Sj in the 
definition of the set B. □ 

During the formation of the IDLA cluster A, each time a particle hits a shell S ai 
(1 < i < k) for the first time, we begin a "trial" which ends either when the particle 
stops walking (due to exiting the cluster and occupying a previously unoccupied 
site) or when it next hits S CH _ 1 U S ai+1 . In the former case, we call the trial a 
"failure." Otherwise, we call it a "success." 

Let F s be the event that the s-th trial fails, and let Q s be the a- field generated 
by all random walks generating the IDLA cluster up until the start of the s-th 
trial. Let A s be the occupied cluster at the time when the s-th trial starts, and let 
E s = n^Lo{^(^ s ^ *%) — a j}- By Lemma 14, we have 

F(F s \g s )i Es > Cl i Es . 

In other words, on the event E s , once the s-th trial begins (and we condition on 
the past of the process — in particular on all previous trials) the trial fails with 
probability at least probability c\. 

Since each failure adds a new site to A, there are at most j^A failures. Next we 
estimate the total number of trials. Each particle that makes it to S ai has to first 
pass through all the S a _, for i' < i. Thus the process of successful growing a set A, 
such that #(A n Sj) = a,j for all j, necessarily includes at least T = Yl a j7j trials. 
Hence on the event T a , there are at least T trials, at most Yl a,- of which fail; and 
conditional on all past trials, each trial fails with probability at least c\. 

Before we put an explicit bound on T, we'd like to find a way to reduce the 
problem to the case that 

#A = aj < ciT/2 (28) 
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(i.e., failure is required to occur at less than half the expected rate). In other words, 
we'd like to reduce the case that Yli a jlj ls larger (by a fixed constant factor 2/ci) 
than a j- This will certainly be the case, for example, if more than a fraction 1/4 
of the sum ^ aj comes from values of j for which jj > 8/ci . 

Let R be a fixed integer greater than 8/ci . Let I be the smallest positive integer 
such that 

m in 

Y a 3 < 4 Y a r 

3=oiir j=am+l 

Then consider the smaller box we get by replacing mbym- am. We can take b 
small enough so that f3f is at most an arbitrarily small constant times m d , and 
hence (3i is at most some arbitrarily small constant times m. Indeed, since R is just 
a constant, we can take b small enough so that A ^ s a ^ mos t an arbitrarily 

small constant times m. Since the sequence 

a (i+l)R~ l 

Y a i 

3= a iR 

decreases by a factor of at least 4 each time % increases by 1 (up until i = I), we 
also obtain that the sequence 

PiR H h P(i+i)R-i 

is also bounded above by a sequence that decreases exponentially in i, and hence (by 
making b small enough) we can arrange so that Yli=i Pi ^ s less than m/2. Then if we 
replace m with m — am (and replace A by its intersection with the corresponding 
smaller box), we reduce the problem to one for which (28) holds and m has decreased 
by at most a factor of 2. 

Now assuming (28), the usual large deviations bound (Cramer's theorem) im- 
plies that the probability of seeing T trials with only c\T/2 or fewer failures is 
exponentially unlikely in T. Note that T = X]j=o a 3~fj * s U P ^° a multiplicative con- 
stant (between d /2 d and d) the same as Y^j=o ^'7j = Xa=i iPf- Hence the proof of 
Lemma A.l is completed by the following lower bound on ^ifif. 

Lemma 15. Fix d > 2 and m > 1, and consider an integer k and sequence of real 
numbers @i, fa, ■ ■ ■ , Pk f or which the following hold: 

1- Pi > 1 for allie {l,...,k}. 

2- £?=i& = m + l. 

3. E{(3) := Y2i=i ifif * s minimal among sequences satisfying the two conditions 
above. 
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There is a constant c(d) > 0, depending only on d, such that 

(E(/3) > c(d)m 2 / log m, d = 2; 



(29) 

E(f3) > c(d)m 2 , d>3. 



Proof. The first two conditions imply that k < m + 1. By Holder's inequality, 

V d / k \ W 

m 



k / k \ ± / u /k \ 

i=l \i=l / \i=l / 



where d' is such that l/d+ 1/d' = 1. Hence 

-d/d' 

E{p) > (m + l) d ' 



vi=l / 



If d > 2, then d'/d < 1 and the right side is at least a constant times m d k^ 1 ^~d ^^d r ^ = 
m d k 2 ~ d > m 2 . In the case d = 2, we have d'/d = 1 and the right side is at least a 
constant times m 2 (log/c) _1 > m 2 (log m) -1 . □ 

A. 2 Bound on the probability of very late points 

In this section we prove Lemma 3. Lawler, Bramson and Griffeath give an expo- 
nential bound on the probability of £-late points in the case that t is comparable to 
the radius of the cluster. Specifically, they show (see Lemma 6 of [LBG92] and the 
displayed equation immediately following it) that for any e > there is a constant C2 
depending on e such that for all z G Bn_ e u 

P(z <£ A((l + e)im 2 )) < 2e~ C2n . (30) 

Fix r > 0. Taking e' = e/2 and n = (1 — e')r in (30), we have (1 — e)r < (1 — e')n 
and 7rr 2 > (1 + e')7rn 2 , so for all z G B( 1 _ e ) r , 

P (z ^(vrr 2 )) < 2e~ C2n < 2e~ cr (31) 

where c is a constant depending on e. 
By definition, 

£,[100vr£ 2 ]= [J {zi A(tt(\z\ + £) 2 )} . 

Taking r = |z| + £ in (31) with e = A, we have for all z G Bg^ 

P{z ^ ^(^r 2 )} < 2e" cr < 2e" rf . 
Summing over z G Bg^ we obtain 

P(^[100t^ 2 ]) < 200vr£V c£ . 
The right side is at most Coe~ c ° e for suitable constants Co, cq > 0, which proves (4). 
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